Maximum-Likelihood Continuity Mapping (MALCOM): An Alternative to HMMs

نویسندگان

David A. Nix

John E. Hogden

چکیده

We describe Maximum-Likelihood Continuity Mapping (MALCOM), an alternative to hidden Markov models (HMMs) for processing sequence data such as speech. While HMMs have a discrete "hidden" space constrained by a fixed finite-automaton architecture, MALCOM has a continuous hidden space-a continuity map-that is constrained only by a smoothness requirement on paths through the space. MALCOM fits into the same probabilistic framework for speech recognition as HMMs, but it represents a more realistic model of the speech production process. To evaluate the extent to which MALCOM captures speech production information, we generated continuous speech continuity maps for three speakers and used the paths through them to predict measured speech articulator data. The median correlation between the MALCOM paths obtained from only the speech acoustics and articulator measurements was 0.77 on an independent test set not used to train MALCOM or the predictor. This unsupervised model achieved correlations over speakers and articulators only 0.02 to 0.15 lower than those obtained using an analogous supervised method which used articulatory measurements as well as acoustics ..

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Stochastic Articulatory-to-acoustic Mapping as a Basis for Speech Recognition

Hidden Markov models (HMMs) of speech acoustics are the current state-of-the-art in speech recognition, but these models bear little resemblance to the processes underlying speech production (Lee, 1989). In this respect, using an HMM to model speech acoustics is like using a Gaussian distribution to model data generated by a Poisson process – to the extent that the model is not an accurate repr...

متن کامل

Improving on Hidden Markov Models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding

The goal of the proposed research is to test a statistical model of speech recognition that incorporates the knowledge that speech is produced by relatively slow motions of the tongue, lips, and other speech articulators. This model is called Maximum Likelihood Continuity Mapping (Malcom). Many speech researchers believe that by using constraints imposed by articulator motions, we can improve o...

متن کامل

Maximum - likelihod adaptation of semi-continuous HMMs by latent variable decomposition of state distributions

Compared to fully-continuous HMMs, semi-continuous HMMs are more compact in size, require less data to train well and result in comparable recognition performance with much faster decoding speeds. Nevertheless, the use of semi-continuous HMMs in large vocabulary speech recognition systems has declined considerably in recent years. A significant factor that has contributed this is that systems t...

متن کامل

Maximum margin hidden Markov models for sequence classification

Discriminative learning methods are known to work well in pattern classification tasks and often show benefits compared to generative learning. This is particularly true in case of model mismatch, i.e. the model cannot represent the true data distribution. In this paper, we derive discriminative maximum margin learning for hidden Markov models (HMMs) with emission probabilities represented by G...

متن کامل

Quasi-Newton method for maximum likelihood estimation of hidden Markov models

Hidden Markov models (HMMs) are used in many signal processing applications including speech recognition, blind equalization of digital communications channels, etc. The most widely used method for maximum likelihood estimation of HMM parameters is the forward-backward (or BaumWelch) algorithm which is an early example of application of the Expectation-Maximization (EM) principle. In this contr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Maximum-Likelihood Continuity Mapping (MALCOM): An Alternative to HMMs

نویسندگان

چکیده

منابع مشابه

A Stochastic Articulatory-to-acoustic Mapping as a Basis for Speech Recognition

Improving on Hidden Markov Models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding

Maximum - likelihod adaptation of semi-continuous HMMs by latent variable decomposition of state distributions

Maximum margin hidden Markov models for sequence classification

Quasi-Newton method for maximum likelihood estimation of hidden Markov models

عنوان ژورنال:

اشتراک گذاری